UCT-0039 seq list.txt 

SEQUENCE LISTING 

<110> University of Connecticut 
weller, Sandra K 
Bacher Reuven, Nina 
Myers, Richard S 

<120> viral Recombi nases , Related Articles and Methods of Use Thereof 

<130> UCT-0039 

<150> US 60/408,092 
<151> 2002-09-04 

<160> 6 

<170> Patentin version 3.2 

<210> 1 

<211> 1881 

<212> DNA 

<213> Herpes simpex virus 1 

<300> 

<308> NC 001806 

<309> 2003-08-18 

<313> (1) . . (1881) 

<400> 1 



atggagtcca 


cggtaggccc 


agcatgtccg 


ccgggacgca 


ccgtgactaa 


gcgtccctgg 


60 


gccctggccg 


aggacacccc 


tcgtggcccc 


gacagccccc 


ccaagcgccc 


ccgccctaac 


120 


agtcttccgc 


tgacaaccac 


cttccgtccc 


ctgccccccc 


caccccagac 


gacatcagct 


180 


gtggacccga 


gctcccattc 


gcccgttaac 


cccccacgtg 


atcagcacgc 


caccgacacc 


240 


gcagacgaaa 


agccccgggc 


cgcgtcgccg 


gcactttctg 


acgcctcagg 


gcctccgacc 


300 


ccagacattc 


cgctatctcc 


tgggggcacc 


cacgcccgcg 


acccggacgc 


cgatcccgac 


360 


tccccggacc 


ttgactctat 


gtggtcggcg 


tcggtgatcc 


ccaacgcgct 


gccctcccat 


420 


atactagccg 


agacgttcga 


gcgccacctg 


cgcgggttgc 


tgcgcggcgt 


ccgcgcccct 


480 


ctggccatcg 


gtcccctctg 


ggcccgcctg 


gattatctgt 


gttccctggc 


cgtggtcctc 


540 


gaggaggcgg 


gtatggtgga 


ccgcggactc 


ggtcggcacc 


tatggcgcct 


gacgcgccgc 


600 


gggcccccgg 


ccgccgcgga 


cgccgtggcg 


ccccggcccc 


tcatggggtt 


ttacgaggcg 


660 


gccacgcaaa 


accaggccga 


ctgccagcta 


tgggccctgc 


tccggcgggg 


cctcacgacc 


720 


gcatccaccc 


tccgctgggg 


cccccagggt 


ccgtgtttct 


cgccccagtg 


gctgaagcac 


780 


aacgccagcc 


tgcggccgga 


tgtacagtct 


tcggcggtga 


tgttcgggcg 


ggtgaacgag 


840 


ccgacggccc 


gaagcctgct 


gtttcgctac 


tgcgtgggcc 


gcgcggacga 


cggcggcgag 


900 


gccggcgccg 


acacgcggcg 


ctttatcttc 


cacgaaccca 


gcgacctcgc 


cgaagagaac 


960 


gtgcatacgt 


gtggggtcct 


catggacggt 


cacacgggga 


tggtcggggc 


gtccctggat 


1020 
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4^ ■♦- -f- ^ 

dtxc Lcg lc L 


g Lcc Lcggga 


UCT-0039 seq 
cattcacggc tacctggccc 


list.txt 






gccu u L Ldcg 


agg uCadaLg 


ccgggccaag 


tacgctttcg 




1^ L. ^ k.ciy ^ y Ci V, 




cccdcggcc L 


ccgcg Lacga 


ggacttgatg 


gcacaccggt 


v.\.ui.yyayy I. 








CgaLCCCgaa 


gcccagcgtg 


cgatacttcg 


v.yLCv.yyyu.y 






ccggaggagg 


CXCLCgLCaC 


gcaagaccag 


gcctggtcag 


aggcccacgc 


ctcgggcgaa 




ddaagycyg^ 




ggatcgggcc 


ttggtggagt 


taaatagcgg 


cgttgtctcg 


1380 


gagg xgcT. lc 


T.y L L^yycy V. 


ccccgacctc 


ggacgccaca 


ccatctcccc 


cgtgtcctgg 


1440 


agctccgggg 


axcLggLCcg 


ccgcgagccc gtcttcgcga acccccgtca cccgaacttt 


1^00 


aagcagatct 


tggtgcaggg 


ctacgtgctc 


gacagccact 


tccccgactg 


ccccccccac 




ccgcatCLgg 


Lgacg L L xa L 


cggcaggcac 


cgcaccagcg 


cggaggaggg 


cgtaacgttc 


1620 


cgcctggagg 


dcggcgccgg 


ggctctcggg 


gccgcaggac 


ccagcaaggc 


gtccattctc 


1680 


ccgaaccagg 


ccgttccgat 


cgccctgatc 


attacccccg 


tccgcatcga 


tccggagatc 


1740 


tataaggcca 


tccagcgaag 


cagccgcctg 


gcattcgacg 


acacgctcgc 


cgagctatgg 


1800 


gcctctcgtt 


ctccggggcc 


cggccctgct 


gctgccgaaa 


caacgtcctc 


atcaccgacg 


1860 


acggggaggt 


cgtctcgctg 


a 








1881 



<210> 2 

<211> 626 

<212> PRT 

<213> Herpes simpex virus 1 

<300> 

<308> GI 119693 

<309> 1992-05-01 

<313> (1) . . C626) 

<400> 2 

Met Glu Ser Thr val Gly Pro Ala Cys Pro Pro Gly Arg Thr val Thr 
15 10 15 

Lys Arg Pro Trp Ala Leu Ala Glu Asp Thr Pro Arg Gly Pro Asp Ser 
20 25 30 



Pro Pro Lys Arg Pro Arg Pro Asn Ser Leu Pro Leu Thr Thr Thr Phe 
35 40 45 



Arg Pro Leu Pro Pro Pro Pro Gin Thr Thr ser Ala val Asp Pro Ser 
50 55 60 



Ser His Ser Pro val Asn Pro Pro Arg Asp Gin His Ala Thr Asp Thr 
65 70 75 80 
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UCT-0039 seq list.txt 
Ala Asp Glu Lys Pro Arg Ala Ala Ser Pro Ala Leu Ser Asp Ala Ser 
85 90 95 

Gly Pro Pro Thr Pro Asp lie Pro Leu Ser Pro Gly Gly Thr His Ala 
100 105 110 

Arg Asp Pro Asp Ala Asp Pro Asp Ser Pro Asp Leu Asp ser Met Trp 
115 120 125 

ser Ala Ser val lie Pro Asn Ala Leu Pro Ser His lie Leu Ala Glu 
130 135 140 

Thr Phe Glu Arg His Leu Arg Gly Leu Leu Arg Gly Val Arg Ala Pro 
145 150 155 160 

Leu Ala lie Gly Pro Leu Trp Ala Arg Leu Asp Tyr Leu Cys Ser Leu 
165 170 175 

Ala Val val Leu Glu Glu Ala Gly Met val Asp Arg Gly Leu Gly Arg 
180 185 190 

His Leu Trp Arg Leu Thr Arg Arg Gly Pro Pro Ala Ala Ala Asp Ala 
195 200 205 

Val Ala Pro Arg Pro Leu Met Gly Phe Tyr Glu Ala Ala Thr Gin Asn 
210 215 220 

Gin Ala Asp Cys Gin Leu Trp Ala Leu Leu Arg Arg Gly Leu Thr Thr 
225 230 235 240 

Ala Ser Thr Leu Arg Trp Gly Pro Gin Gly Pro Cys Phe Ser Pro Gin 
245 250 255 

Trp Leu Lys His Asn Ala Ser Leu Arg Pro Asp val Gin ser Ser Ala 
260 265 270 

Val Met Phe Gly Arg val Asn Glu Pro Thr Ala Arg Ser Leu Leu Phe 
275 280 285 

Arg Tyr Cys val Gly Arg Ala Asp Asp Gly Gly Glu Ala Gly Ala Asp 
290 295 300 

Thr Arg Arg Phe lie Phe His Glu Pro Ser Asp Leu Ala Glu Glu Asn 
305 310 315 320 



val His Thr Cys Gly Val Leu Met Asp Gly His Thr Gly Met val Gly 
325 330 335 
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Ala Ser Leu Asp lie Leu val Cys Pro Arg Asp lie His Gly Tyr Leu 
340 345 350 



Ala Pro val Pro Lys Thr Pro Leu Ala Phe Tyr Glu val Lys Cys Arg 
355 360 365 



Ala Lys Tyr Ala Phe Asp Pro Met Asp Pro Ser Asp Pro Thr Ala Ser 
370 375 380 



Ala Tyr Glu Asp Leu Met Ala His Arg Ser Pro Glu Ala Phe Arg Ala 
385 390 395 400 



Phe lie Arg Ser lie Pro Lys Pro Ser val Arg Tyr Phe Ala Pro Gly 
405 410 415 



Val Pro Gly Pro Glu Glu Ala Leu val Thr Gin Asp Gin Ala Trp 
420 425 430 



ser Glu Ala His Ala Ser Gly Glu Lys Arg Arg Cys ser Ala Ala Asp 
435 440 445 



Arg Ala Leu val Glu Leu Asn Ser Gly Val val Ser Glu Val Leu Leu 
450 455 460 



Phe Gly Ala Pro Asp Leu Gly Arg His Thr lie Ser Pro val Ser Trp 
465 470 475 480 



Ser Ser Gly Asp Leu val Arg Arg Glu Pro val Phe Ala Asn Pro Arg 
485 490 495 



His Pro Asn Phe Lys Gin lie Leu val Gin Gly Tyr val Leu Asp Ser 
500 505 510 



His Phe Pro Asp Cys Pro Pro His Pro His Leu val Thr Phe lie Gly 
515 520 525 



Arg His Arg Thr Ser Ala Glu Glu Gly val Thr Phe Arg Leu Glu Asp 
530 535 540 



Gly Ala Gly Ala Leu Gly Ala Ala Gly Pro ser Lys Ala Ser lie Leu 
545 550 555 560 



Pro Asn Gin Ala val Pro lie Ala Leu lie lie Thr Pro val Arg lie 



Pro Glu He Tyr Lys Ala lie Gin Arg Ser Ser Arg Leu Ala Phe 



565 



570 




580 



585 



590 
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Asp Asp Thr Leu Ala Glu Leu Trp Ala Ser Arg Ser Pro Gly Pro Gly 
595 600 605 

Pro Ala Ala Ala Glu Thr Thr ser Ser Ser Pro Thr Thr Gly Arg Ser 



Ser Arg 
625 

<210> 3 

<211> 4420 

<212> DNA 

<213> Herpes simpex virus 1 

<300> 

<308> M20165 

<309> 1994-04-19 

<313> (1). .(4420) 

<400> 3 

cggatccggg cggcgagctg ctgcgcggcg ccccggccgg cggcccggtt tattcgcgtc 60 

ggcccggccg gccgggctta tggaccgccg gcggccgaca ggagcgtgac gtagccggtg 120 

ggcgtggccg ctattataaa aaaagtgaga acgcgaagcg ttcgcacttt gtcctaataa 180 

tatatatatt attaggacaa agtgcgaacg cttcgcgttc tcactttttt tataatagcg 240 

gccacgccca ccggctgatg acgcgcgggg tgtgggaggg gctggggcgg tccggcacgc 300 

ccccaggtaa agtgtacata taccaaccgc atatcagacg cacccggccc ggagcacctg 360 

accgtaagca tctgtgcctc tcgcagggac cccgcgttgc cagccgccgg ggttcatcgg 420 

caccccgtgg ttacccgggg gttgtcggtg aagggtaggg attcattccc caaccccggt 480 

ctcccaccct ccccttgacc gtcgccgccc ccccccccgg attttgacgc tcgggagaca 540 

tacctcgtcg ggcgtccgtc gtcgtgccgg gattacctcc gtttgcggac cgattgccag 600 

gaggacatgg agacaaagcc caagacggca accaccatca aggtcccccc cgggcccctg 660 

ggatacgtgt acgctcgcgc gtgtccgtcc gaaggcatcg agcttctggc gttactgtcg 720 

gcgcgcagcg gcgatgccga cgtcgccgtg gcgcccctgg tcgtgggcct gaccgtggag 780 

agcggctttg aggccaacgt agccgtggtc gtgggttctc gcacgacggg gctcgggggt 840 

accgcggtgt ccctgaaact gacgccatcg cactacagct cgtccgtgta cgtctttcac 900 

ggcggccggc acctggaccc cagcacccag gccccaaacc tgacgcgact ctgcgagcgg 960 

gcacgccgcc attttggctt ttcggactac accccccggc ccggcgacct caaacacgag 1020 

acgacggggg aggcgctgtg tgagcgcctc ggcctggacc cggaccgcgc cctcctgtat 1080 

ctggtcgtta ccgagggctt caaggaggcc gtgtgcatca acaacacctt tctgcacctg 1140 
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ggaggctcgg 


accidgy Lddc 


UCT-0039 seq 
cataggcggg gcggaggtgc 


list.txt 

ciCL.ycci L<ii«^ 


n 1" n t* a 1" r* n 
v^yLyLdLCcy 


1200 

\J\J 


TtgcagcxgL 


LCaxgccgyd 


ttttagccgg 


gtcatcgccg 


ay^^y l lv.cici 


v*y^v.dd^(.dL. 


1260 

I- \J\J 


cgd Lcga Lcg 


ggyciyaaL L L 


tacctacccg 


cttccgtttt 


^r^r'>7i^ cc\c c r 
L Lciciv.^y^^v, 


\^\. L^adL.L.yc 


1520 




aggcggLcgL 


gggacccgcc 


gccgtggcac 


Ly \»yci Ly^v.y 


aaav.^ i.^^a^ 


1380 


gccguygccc 


gcgcgyctyc 


ccacctggcg 


tttgacgaaa 


ci\»^a^yayyy 




1440 


^ f n f F\ ^ ^3 

cccyC'V-gci^u 


LXaCy Ll.(.d.C. 


ggccttcgaa 


gccagccagg 




y^yyyy Lyyy 


1500 


cgcgacggcg 


gcggcaaygg 


cccggcgggc 


gggttcgaac 


ay ^y^k. uyy v. 


w laVa^^ L^a ^y 


1560 


gccggagacg 


ccgccc rggc 


cctcgagtct 


atcgtgtcga 


tyyccy uv. u l 


i.ydv.ydy v.^y 


1620 


cccaccgaca 


rc Lccgcg xg 


gccgctgtgc 


gagggccagg 


a.uciL.yy ^v.y u 


y y 1. V. L. y L.y i. L. 




aacgccgxcg 


gggcg La.cc L 


ggcgcgcgcc 


gcgggactcg 


Lyyyyy v.v.a l 


n n 1" a "t" 1" 1" 3 n r* 
yy ^ A L u Lay ^ 


1740 


aCCadC UCyy 


CCC LCCd. LC L 


caccgaggtg 


gacgacgccg 


y LL^uyyuyya 


rrraaaooar 


1800 


CaCagCaaaC 


r* r* 1- r* r 1" t- 1" 1- 
CCLCCLUULd, 


ccgcttcttc 


ctcgtgcccg 




yy ^yy ^^""^ 


I860 


ccacagg igg 


accgcgaggg 


acacgtggtg 


cccgggttcg 


«yyy Lcyycc 




1920 


ctcgtcggcg 


gaacccagga 


atttgccggc 


gagcacctgg 


ccd Ly V. Ly Ly 


1" n n n t" t* 1* "t* r 
LyyyLLLL^v> 




ccggcgctgc 


nggccaagaL 


gctgttttac 


ctggagcgct 


gcgacggcgg 


L.y Lyd Li..y lv. 


2040 


gggcgccagg 


agarggacgr 


gtttcgatac 


gtcgcggact 


V V. da L. V. cty d 


v>ydi.y L.y V.I..V. 


2100 


Lgcaaccxg L 


gcacc LLcgci 


cacgcgccac 


gcctgcgtac 


dL.dv.yav.y ^ l 


v.a Ly v.y L< w< ll. 


2160 


cgggcgcgcc 


auCCCaay L U 


cgccagcgcc 


gcccgcggag 


^v.a Luyy ^y l 


L. L LV-yyyav.^ 


2220 


atgaacagca 


xg xacaycga 


ctgcgacgtg 


ctgggaaact 


dv.y^i..y^^ L L 


L. LL.yyL.L.L. Ly 


2280 

^ t. W V/ 


aagcgcgcgg 


dcgya LCCyd 


gaccgcccgg 


accatcatgc 


ayyaya\.y ua 


^'-y ^y ^yy 


2340 


accgagcgcg 


xcaLggccga 


actcgagacc 


ctgcagtacg 


Lyydccdyyu 


yy Li..\.v.^a^^ 


2400 


gccatggggc 


M M M M ^ « "5 ^ 

ggctggagac 


catcatcacc 


aaccgcgagg 


CL.L.LyCdLdV. 


yy ^yy LycicLL. 


2460 


aacgtcaggc 


aggtcgxgga 


ccgcgaggtg 


gagcagctga 


LycycddCv. L 


gyxggdygyy 


2520 


aggaacrrca 


<^ +" +■ ^ M n 0 

ag L L Lcgcga 


cggtctgggc 


gaggccaacc 


d^y^v.d Ly 


L<L. Lya^y v« Ly 


2580 


gacccgxacg 


cgxgcgggcc 


atgccccctg 


cttcagcttc 


xcggytyyi-y 


a LL.L.aav.\M l^ 


2640 


M m4*m^ ^ ^ ^ 

gccgtgtatc 


aggaccxgyc 


cctgagccag 


tgccacgggg 


i>y L Lv.y^^yy 


yvay LL.yy ll* 


2700 

£■ # \J\J 


gaggggcgca 


acx L Lcgcaa 


tcaattccaa 


ccggtgctgc 


gg^gg^g^yi- 


yd Lyy aL.a Ly 


2760 


tttaacaacg 


ggtttctgtc 


ggccaaaacg 


ctgacggtcg 


cycxcu(.yyd 


gggygcggcx 


2R20 


atctgcgccc 


ccagcctaac 


ggccggccag 


acggcccccg 


ccgagagcag 


cttcgagggc 


2880 


gacgttgccc 


gcgtgaccct 


ggggtttccc 


aaggagctgc 


gcgtcaagag 


ccgcgtgttg 


2940 


ttcgcgggcg 


cgagcgccaa 


cgcgtccgag 


gccgccaagg 


cgcgggtcgc 


cagcctccag 


3000 


agcgcctacc 


agaagcccga 


caagcgcgtg 


gacatcctcc 
Page 


tcggaccgct 
6 


gggctttctg 


3060 
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L V. V.CI v.y y ^ 


\mCi \m\m\. \m\m\m\m\m 


dav.y yi^ddyi. 


v.L.L.L.yyyy ll- 


L.aav.^ay L. v.y 


3120 




yy L L v> i.yy 




y v> a CL %■ %> ci y Vm 


L L\v\vLiy 




3180 




Lv.yGiycii.^ci L 


r fi r* n 1" IT' a *t" 1" 


a a a a a n1"1"1"1" 
aciciciay l l l l 


rr f"f" n n a f" "t* a 

LyyCIl- LCi 


v»yy Va^^^a ua 


3240 


cLaLX^ La L Lo. 


d L. L. I. y y 1* L. ^ 


paar"aar"r!^ri 
udd^ddv.y Ly 


dy v-ydyL. Lyy 


L.yoi Ly LciL. La 


f"a^nnr'aaar' 
L.a Lyy v.aaai.. 


3300 




yy LdL uycyd 


^far'^rriar'a 
LL.dL. L v.ydv.d 


1"ar"f"1"r*a1"r*a 

LdL. L LL.dL(i.d 


a r*a ccfYcTk c 


nnrra "t* fa IT" 
yy L.v.d LL.d LL. 


3360 


9C9999LCCC 


r\ r' rA'^' f~ f~ f t~ 

y Lcy Lccccc 


^dyuy uyudy 


gcggcggccg 


^-y v-yy un-yt. 


gcdgyycygy 


3420 


gcgggcctgg 


aggccggggc 


ccy cy c.y c Ly 


d Lyydcy L.L.y 


Ly ydL.y L.y L.d 


tL-cggycgcg 


34R0 


Lyydcy LCCa 


Ly L LcgLLdy 


L. Ly cddcv, Ly 


CLguggcccg 


LLid Lyy L.yy L. 


yL,yi«L.L.L.dLy 


3^40 


gLcy Lg L xgy 


yy L Lydy L.d l 


a n r^a a a t" a 
L.dy L.ddd Ldv. 


LdL.yy L.d Lyy 


L.v.yy L>daL<ya 


ff n 1" n "f" n 1" 1" 1" 
L<L.y Ly Ly l l l 


3fiOO 


i.dyyu\.yyyd 


du Lyyy^i^dy 


L.L. Lyd Lyyyi. 


nnr'aaaaaf n 
yy v.ciciGioiciv.y 


L.y LyL.v.L.yL. l 


v>%K !• La 1. 1, u 1. 1. 


3660 


ydL.L.yL.dCv.L. 


yLddy I, Lv.y l 


Lyy L.V. Ly L 


L. L. L. L. y y y L. L. y 


yy L L Ly Ly Ly 


L.y L>y y ^L. LL.y 


3720 


ddcc Lcgycy 


gcyy dy cy v.d 


L.ydddy L. LL.y 


L. Ly LyL.ydyL. 


dyL. LL.L.yyyy 


L.dLLdLL.Ll.L. 


3780 


gagggcgggg 


cggccg Lcgc 


L.dy Ldyuy Ly 


L LL.y LyyL.yd 


rTTitriaaaafi 
L.L.y Lyddaay 


LyyyyL.L.L. 


3840 


CyCdLCCdy V, 


dycLycdydL 


L.ydyydi- Lyy 


L. Lyy L.y L. ll.l. 


^nnannarria 
Lyy ay yaL.y a 


nl" a ffl" a a n f 


3Q00 


gdy ydydT^yd 


Lyy dy u LydL. 


L.yL.yL.y LycL. 


L. Lyyciy L.y L.y 


yLiCiaL.y y L.y d 


y ^yy LL-ycu-y 


3Q60 


yduy i.y y ^v.^ 


"tTinanntTin/* 

^gg^9g»-gg^ 


yL.dL.ydyycL. 


yciyy Lciy 


Lv.dyLiV.ddL.L 




4020 


ggggsggLgi: 


T-T-ddC L L Xyy 


yydT. LT.T.yy L, 


Lgcydgydcy 


dL,ddL.y L.y dL. 


yi^uy L L(.yy\. 


4080 


ggcccggggg 


L. L. y y y d Li 


yycdL LLyi.L. 


nnr"i"nr"aaar' 
yy L.L.y L.dddL. 


yyyL.y l LL.L.d 


C C\C\C\C\7K't' C\7\C 

^yyyya i.^a^ 


4140 


ccgtttgggg 


aggggccccc 


cgacaaaaag 


ggagacctga 


cgttggatat 


gctgtgaggg 


4200 


gttggggggt 


gggggaacct 


agggcggggc 


ggggaatgtg 


tgtaaaataa 


attattgcta 


4260 


cgacatccgt 


gcttgtttgt 


gttccgtgtc 


tatatctctg 


ggcgggccgt 


gattcctctc 


4320 


cgcggtgtct 


gggaatagaa 


cagaaacgca 


cgcgccgccg 


actcccggct 


tgccggtcgg 


4380 


cgggcccgcg 


ggaggccgcc 


ccgaagaggg 


ggaccccggg 






4420 



<210> 4 

<211> 1196 

<212> PRT 

<213> Herpes simpex virus 1 
<300> 

<308> M20165 

<309> 1994-04-19 

<313> (1) . . (1196) 

<400> 4 

Met Glu Thr Lys Pro Lys Thr Ala Thr Thr lie Lys val Pro Pro Gly 
15 10 15 
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Pro Leu Gly Tyr Val Tyr Ala Arg Ala Cys Pro Ser Glu Gly lie Glu 
20 25 30 

Leu Leu Ala Leu Leu Ser Ala Arg Ser Gly Asp Ala Asp val Ala val 
35 40 45 

Ala Pro Leu val val Gly Leu Thr val Glu Ser Gly Phe Glu Ala Asn 
50 55 60 

Val Ala val val val Gly ser Arg Thr Thr Gly Leu Gly Gly Thr Ala 
65 70 75 80 

val Ser Leu Lys Leu Thr Pro Ser His Tyr ser Ser ser val Tyr val 
85 90 95 

Phe His Gly Gly Arg His Leu Asp Pro ser Thr Gin Ala Pro Asn Leu 
100 105 110 

Thr Arg Leu Cys Glu Arg Ala Arg Arg His Phe Gly Phe Ser Asp Tyr 
115 120 125 

Thr Pro Arg Pro Gly Asp Leu Lys His Glu Thr Thr Gly Glu Ala Leu 
130 135 140 

Cys Glu Arg Leu Gly Leu Asp Pro Asp Arg Ala Leu Leu Tyr Leu val 
145 150 155 160 

val Thr Glu Gly Phe Lys Glu Ala val Cys lie Asn Asn Thr Phe Leu 
165 170 175 

His Leu Gly Gly ser Asp Lys val Thr lie Gly Gly Ala Glu val His 
180 185 190 

Arg lie Pro val Tyr Pro Leu Gin Leu Phe Met Pro Asp Phe Ser Arg 
195 200 205 

val lie Ala Glu Pro Phe Asn Ala Asn His Arg ser lie Gly Glu Asn 
210 215 220 

Phe Thr Tyr Pro Leu Pro Phe Phe Asn Arg Pro Leu Asn Arg Leu Leu 
225 230 235 240 

Phe Glu Ala val val Gly Pro Ala Ala val Ala Leu Arg cys Arg Asn 
245 250 255 



Val Asp Ala val Ala Arg Ala Ala Ala His Leu Ala Phe Asp Glu Asn 
260 265 270 
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His Glu Gly Ala Ala Leu Pro Ala Asp lie Thr Phe Thr Ala Phe Glu 
275 280 285 

Ala Ser Gin Gly Lys Thr Pro Arg Gly Gly Arg Asp Gly Gly Gly Lys 
290 295 300 

Gly Pro Ala Gly Gly Phe Glu Gin Arg Leu Ala Ser val Met Ala Gly 
305 310 315 320 

Asp Ala Ala Leu Ala Leu Glu Ser lie val Ser Met Ala val Phe Asp 
325 330 335 

Glu Pro Pro Thr Asp lie Ser Ala Trp Pro Leu Cys Glu Gly Gin Asp 
340 345 350 

Thr Ala Ala Ala Arg Ala Asn Ala val Gly Ala Tyr Leu Ala Arg Ala 
355 360 365 

Ala Gly Leu val Gly Ala Met val Phe Ser Thr Asn Ser Ala Leu His 
370 375 380 

Leu Thr Glu val Asp Asp Ala Gly Pro Ala Asp Pro Lys Asp His Ser 
385 390 395 400 

Lys Pro Ser Phe Tyr Arg Phe Phe Leu val Pro Gly Thr His val Ala 
405 410 415 

Ala Asn Pro Gin val Asp Arg Glu Gly His val val Pro Gly Phe Glu 
420 425 430 

Gly Arg Pro Thr Ala Pro Leu val Gly Gly Thr Gin Glu Phe Ala Gly 
435 440 445 

Glu His Leu Ala Met Leu Cys Gly Phe Ser Pro Ala Leu Leu Ala Lys 
450 455 460 

Met Leu Phe Tyr Leu Glu Arg Cys Asp Gly Gly val lie val Gly Arg 
465 470 475 480 

Gin Glu Met Asp val Phe Arg Tyr val Ala Asp Ser Asn Gin Thr Asp 
485 490 495 

val Pro Cys Asn Leu Cys Thr Phe Asp Thr Arg His Ala Cys val His 
500 505 510 



Thr Thr Leu Met Arg Leu Arg Ala Arg His Pro Lys Phe Ala Ser Ala 
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520 525 



Ala Arg Gly Ala lie Gly val Phe Gly Thr Met Asn ser Met Tyr Ser 
530 535 540 

Asp Cys Asp val Leu Gly Asn Tyr Ala Ala Phe Ser Ala Leu Lys Arg 
545 550 555 560 

Ala Asp Gly Ser Glu Thr Ala Arg Thr lie Met Gin Glu Thr Tyr Arg 
565 570 575 

Ala Ala Thr Glu Arg Val Met Ala Glu Leu Glu Thr Leu Gin Tyr val 
580 585 590 

Asp Gin Ala Val Pro Thr Ala Met Gly Arg Leu Glu Thr lie lie Thr 
595 600 605 

Asn Arg Glu Ala Leu His Thr Val val Asn Asn val Arg Gin Val val 
610 615 620 

ASP Arg Glu val Glu Gin Leu Met Arg Asn Leu val Glu Gly Arg Asn 
625 630 635 640 

Phe Lys Phe Arg Asp Gly Leu Gly Glu Ala Asn His Ala Met Ser Leu 
645 650 655 

Thr Leu Asp Pro Tyr Ala Cys Gly Pro Cys Pro Leu Leu Gin Leu Leu 
660 665 670 

Gly Arg Arg Ser Asn Leu Ala val Tyr Gin Asp Leu Ala Leu Ser Gin 
675 680 685 

Cys His Gly val Phe Ala Gly Gin ser Val Glu Gly Arg Asn Phe Arg 
690 695 700 

Asn Gin Phe Gin Pro val Leu Arg Arg Arg val Met Asp Met Phe Asn 
705 710 715 720 

Asn Gly Phe Leu ser Ala Lys Thr Leu Thr val Ala Leu Ser Glu Gly 
725 730 735 

Ala Ala lie Cys Ala Pro Ser Leu Thr Ala Gly Gin Thr Ala pro Ala 
740 745 750 

Glu Ser Ser Phe Glu Gly Asp val Ala Arg val Thr Leu Gly Phe Pro 
755 760 765 
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Lys Glu Leu Arg Val Lys Ser Arg val Leu Phe Ala Gly Ala Ser Ala 
770 775 780 

Asn Ala Ser Glu Ala Ala Lys Ala Arg Val Ala Ser Leu Gin Ser Ala 
785 790 795 800 

Tyr Gin Lys Pro Asp Lys Arg val Asp lie Leu Leu Gly Pro Leu Gly 
805 810 815 

Phe Leu Leu Lys Gin Phe His Ala Ala lie Phe Pro Asn Gly Lys Pro 
820 825 830 

Pro Gly ser Asn Gin Pro Asn Pro Gin Trp Phe Trp Thr Ala Leu Gin 
835 840 845 

Arg Asn Gin Leu Pro Ala Arg Leu Leu Ser Arg Glu Asp lie Glu Thr 
850 855 860 

lie Ala Phe lie Lys Lys Phe Ser Leu Asp Tyr Gly Ala lie Asn Phe 
865 870 875 880 

lie Asn Leu Ala Pro Asn Asn val Ser Glu Leu Ala Met Tyr Tyr Met 
885 890 895 

Ala Asn Gin lie Leu Arg Tyr Cys Asp His Ser Thr Tyr Phe lie Asn 
900 905 910 

Thr Leu Thr Ala lie lie Ala Gly ser Arg Arg Pro Pro Ser val Gin 
915 920 925 

Ala Ala Ala Ala Trp Ser Ala Gin Gly Gly Ala Gly Leu Glu Ala Gly 
930 935 940 

Ala Arg Ala Leu Met Asp Ala val Asp Ala His Pro Gly Ala Trp Thr 
945 950 955 960 



ser Met Phe Ala Ser Cys Asn Leu Leu Arg Pro Val Met Ala Ala Arg 
965 970 975 



Pro Met Val val Leu Gly Leu Ser lie Ser Lys Tyr Tyr Gly Met Ala 
980 985 990 

Gly Asn Asp Arg val Phe Gin Ala Gly Asn Trp Ala ser Leu Met Gly 
995 1000 1005 



Gly Lys Asn Ala Cys Pro Leu Leu lie Phe Asp Arg Thr Arg Lys 
1010 1015 1020 
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Phe val Leu Ala Cys Pro Arg Ala Gly Phe Val cys Ala Ala Ser 
1025 1030 1035 

Asn Leu Gly Gly Gly Ala His Glu Ser Ser Leu cys Glu Gin Leu 
1040 1045 1050 

Arg Gly lie lie Ser Glu Gly Gly Ala Ala Val Ala Ser ser val 
1055 1060 1065 

Phe Val Ala Thr val Lys ser Leu Gly Pro Arg Thr Gin Gin Leu 
1070 1075 1080 

Gin He Glu Asp Trp Leu Ala Leu Leu Glu Asp Glu Tyr Leu ser 
1085 1090 1095 

Glu Glu Met Met Glu Leu Thr Ala Arg Ala Leu Glu Arg Gly Asn 
1100 1105 1110 

Gly Glu Trp ser Thr Asp Ala Ala Leu Glu val Ala His Glu Ala 
1115 1120 1125 

Glu Ala Leu val Ser Gin Leu Gly Asn Ala Gly Glu val Phe Asn 
1130 1135 1140 

Phe Gly Asp Phe Gly Cys Glu Asp Asp Asn Ala Thr Pro Phe Gly 
1145 1150 1155 

Gly Pro Gly Ala Pro Gly Pro Ala Phe Ala Gly Arg Lys Arg Ala 
1160 1165 1170 

Phe His Gly Asp Asp Pro Phe Gly Glu Gly Pro Pro Asp Lys Lys 
1175 1180 1185 

Gly Asp Leu Thr Leu Asp Met Leu 
1190 1195 

<210> 5 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Ml3 DNA primer 

<400> 5 

gtcggtgacg gtgataattc acctttaatg 

<210> 6 
<211> 30 
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<212> DNA 

<213> Artificial 

<220> 

<223> MIS DNA primer 

<400> 6 

cattaaaggt gaattatcac cgtcaccgac 
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